Overview

Dataset statistics

Number of variables26
Number of observations235795
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory46.8 MiB
Average record size in memory208.0 B

Variable types

Numeric21
Categorical5

Warnings

PLAYER_NAME has a high cardinality: 1433 distinct values High cardinality
MIN has a high cardinality: 3200 distinct values High cardinality
FGM is highly correlated with FGA and 1 other fieldsHigh correlation
FGA is highly correlated with FGM and 1 other fieldsHigh correlation
FG3M is highly correlated with FG3A and 1 other fieldsHigh correlation
FG3A is highly correlated with FG3MHigh correlation
FG3_PCT is highly correlated with FG3MHigh correlation
FTM is highly correlated with FTA and 2 other fieldsHigh correlation
FTA is highly correlated with FTM and 2 other fieldsHigh correlation
FT_PCT is highly correlated with FTM and 1 other fieldsHigh correlation
OREB is highly correlated with REBHigh correlation
DREB is highly correlated with REBHigh correlation
REB is highly correlated with OREB and 1 other fieldsHigh correlation
PTS is highly correlated with FGM and 3 other fieldsHigh correlation
FGM is highly correlated with FGA and 2 other fieldsHigh correlation
FGA is highly correlated with FGM and 1 other fieldsHigh correlation
FG_PCT is highly correlated with FGMHigh correlation
FG3M is highly correlated with FG3A and 1 other fieldsHigh correlation
FG3A is highly correlated with FG3M and 1 other fieldsHigh correlation
FG3_PCT is highly correlated with FG3M and 1 other fieldsHigh correlation
FTM is highly correlated with FTA and 2 other fieldsHigh correlation
FTA is highly correlated with FTM and 2 other fieldsHigh correlation
FT_PCT is highly correlated with FTM and 1 other fieldsHigh correlation
OREB is highly correlated with REBHigh correlation
DREB is highly correlated with REBHigh correlation
REB is highly correlated with OREB and 1 other fieldsHigh correlation
PTS is highly correlated with FGM and 3 other fieldsHigh correlation
FGM is highly correlated with FGA and 1 other fieldsHigh correlation
FGA is highly correlated with FGM and 1 other fieldsHigh correlation
FG3M is highly correlated with FG3A and 1 other fieldsHigh correlation
FG3A is highly correlated with FG3MHigh correlation
FG3_PCT is highly correlated with FG3MHigh correlation
FTM is highly correlated with FTA and 1 other fieldsHigh correlation
FTA is highly correlated with FTMHigh correlation
FT_PCT is highly correlated with FTMHigh correlation
OREB is highly correlated with REBHigh correlation
DREB is highly correlated with REBHigh correlation
REB is highly correlated with OREB and 1 other fieldsHigh correlation
PTS is highly correlated with FGM and 1 other fieldsHigh correlation
FGA is highly correlated with PTS and 3 other fieldsHigh correlation
PTS is highly correlated with FGA and 6 other fieldsHigh correlation
REB is highly correlated with OREB and 1 other fieldsHigh correlation
FT_PCT is highly correlated with PTS and 2 other fieldsHigh correlation
FG3M is highly correlated with PTS and 2 other fieldsHigh correlation
OREB is highly correlated with REBHigh correlation
FG3_PCT is highly correlated with FG3M and 1 other fieldsHigh correlation
FG_PCT is highly correlated with FGA and 2 other fieldsHigh correlation
FGM is highly correlated with FGA and 2 other fieldsHigh correlation
FTM is highly correlated with PTS and 2 other fieldsHigh correlation
TEAM_CITY is highly correlated with TEAM_ABBREVIATIONHigh correlation
TEAM_ABBREVIATION is highly correlated with TEAM_CITYHigh correlation
FG3A is highly correlated with FGA and 2 other fieldsHigh correlation
FTA is highly correlated with PTS and 2 other fieldsHigh correlation
DREB is highly correlated with REBHigh correlation
TEAM_ABBREVIATION is highly correlated with TEAM_CITYHigh correlation
TEAM_CITY is highly correlated with TEAM_ABBREVIATIONHigh correlation
df_index has unique values Unique
FGM has 8341 (3.5%) zeros Zeros
FG_PCT has 8341 (3.5%) zeros Zeros
FG3M has 121679 (51.6%) zeros Zeros
FG3A has 74056 (31.4%) zeros Zeros
FG3_PCT has 121679 (51.6%) zeros Zeros
FTM has 70624 (30.0%) zeros Zeros
FTA has 62467 (26.5%) zeros Zeros
FT_PCT has 70624 (30.0%) zeros Zeros
OREB has 87416 (37.1%) zeros Zeros
DREB has 12295 (5.2%) zeros Zeros
REB has 7168 (3.0%) zeros Zeros
AST has 38786 (16.4%) zeros Zeros
STL has 97779 (41.5%) zeros Zeros
BLK has 143602 (60.9%) zeros Zeros
TO has 48402 (20.5%) zeros Zeros
PF has 20991 (8.9%) zeros Zeros
PTS has 5577 (2.4%) zeros Zeros
PLUS_MINUS has 7488 (3.2%) zeros Zeros

Reproduction

Analysis started2021-07-14 15:33:39.843016
Analysis finished2021-07-14 15:37:18.369965
Duration3 minutes and 38.53 seconds
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct235795
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean304752.3991
Minimum0
Maximum612961
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile31970.7
Q1151189.5
median302143
Q3456701.5
95-th percentile582896.7
Maximum612961
Range612961
Interquartile range (IQR)305512

Descriptive statistics

Standard deviation176291.1272
Coefficient of variation (CV)0.5784733038
Kurtosis-1.193105763
Mean304752.3991
Median Absolute Deviation (MAD)152692
Skewness0.01639119569
Sum7.185909195 × 1010
Variance3.107856152 × 1010
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5283821
 
< 0.1%
5927561
 
< 0.1%
746091
 
< 0.1%
1176121
 
< 0.1%
1237531
 
< 0.1%
1258001
 
< 0.1%
1032711
 
< 0.1%
1053181
 
< 0.1%
991731
 
< 0.1%
1135061
 
< 0.1%
Other values (235785)235785
> 99.9%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
141
< 0.1%
151
< 0.1%
161
< 0.1%
171
< 0.1%
181
< 0.1%
ValueCountFrequency (%)
6129611
< 0.1%
6129601
< 0.1%
6129591
< 0.1%
6129581
< 0.1%
6129571
< 0.1%
6129491
< 0.1%
6129481
< 0.1%
6129471
< 0.1%
6129461
< 0.1%
6129451
< 0.1%

GAME_ID
Real number (ℝ≥0)

Distinct23565
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22130233.66
Minimum11400001
Maximum52000211
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum11400001
5-th percentile20300594
Q120700431
median21200416
Q321700162
95-th percentile40600116.1
Maximum52000211
Range40600210
Interquartile range (IQR)999731

Descriptive statistics

Standard deviation5104510.513
Coefficient of variation (CV)0.2306577776
Kurtosis9.464575028
Mean22130233.66
Median Absolute Deviation (MAD)499865
Skewness2.966457018
Sum5.218198445 × 1012
Variance2.605602758 × 1013
MonotonicityIncreasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2200001717
 
< 0.1%
2200002115
 
< 0.1%
2200007015
 
< 0.1%
2200000115
 
< 0.1%
2200000714
 
< 0.1%
2200005814
 
< 0.1%
2200004814
 
< 0.1%
2200002714
 
< 0.1%
2200000814
 
< 0.1%
2200000214
 
< 0.1%
Other values (23555)235649
99.9%
ValueCountFrequency (%)
1140000110
< 0.1%
1140000210
< 0.1%
1140000410
< 0.1%
1140000510
< 0.1%
1140000610
< 0.1%
1140000710
< 0.1%
1140000810
< 0.1%
1140000910
< 0.1%
1140001010
< 0.1%
1140001210
< 0.1%
ValueCountFrequency (%)
5200021110
< 0.1%
5200020110
< 0.1%
5200013110
< 0.1%
5200012110
< 0.1%
5200011110
< 0.1%
5200010110
< 0.1%
5190011110
< 0.1%
4200017210
< 0.1%
4200017110
< 0.1%
4200016210
< 0.1%

TEAM_ABBREVIATION
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct34
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.8 MiB
SAS
 
8433
MIA
 
8418
BOS
 
8319
LAL
 
8196
CLE
 
8104
Other values (29)
194325 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters707385
Distinct characters22
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMIA
2nd rowMIA
3rd rowMIA
4th rowMIA
5th rowMIA

Common Values

ValueCountFrequency (%)
SAS8433
 
3.6%
MIA8418
 
3.6%
BOS8319
 
3.5%
LAL8196
 
3.5%
CLE8104
 
3.4%
GSW8055
 
3.4%
HOU8027
 
3.4%
DAL8019
 
3.4%
IND7974
 
3.4%
DET7931
 
3.4%
Other values (24)154319
65.4%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
sas8433
 
3.6%
mia8418
 
3.6%
bos8319
 
3.5%
lal8196
 
3.5%
cle8104
 
3.4%
gsw8055
 
3.4%
hou8027
 
3.4%
dal8019
 
3.4%
ind7974
 
3.4%
det7931
 
3.4%
Other values (24)154319
65.4%

Most occurring characters

ValueCountFrequency (%)
A80966
 
11.4%
L63730
 
9.0%
O53321
 
7.5%
S50599
 
7.2%
N49985
 
7.1%
I47144
 
6.7%
C44232
 
6.3%
H41728
 
5.9%
M39232
 
5.5%
E33865
 
4.8%
Other values (12)202583
28.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter707385
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A80966
 
11.4%
L63730
 
9.0%
O53321
 
7.5%
S50599
 
7.2%
N49985
 
7.1%
I47144
 
6.7%
C44232
 
6.3%
H41728
 
5.9%
M39232
 
5.5%
E33865
 
4.8%
Other values (12)202583
28.6%

Most occurring scripts

ValueCountFrequency (%)
Latin707385
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A80966
 
11.4%
L63730
 
9.0%
O53321
 
7.5%
S50599
 
7.2%
N49985
 
7.1%
I47144
 
6.7%
C44232
 
6.3%
H41728
 
5.9%
M39232
 
5.5%
E33865
 
4.8%
Other values (12)202583
28.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII707385
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A80966
 
11.4%
L63730
 
9.0%
O53321
 
7.5%
S50599
 
7.2%
N49985
 
7.1%
I47144
 
6.7%
C44232
 
6.3%
H41728
 
5.9%
M39232
 
5.5%
E33865
 
4.8%
Other values (12)202583
28.6%

TEAM_CITY
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct33
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.8 MiB
Los Angeles
 
13356
San Antonio
 
8433
Miami
 
8418
Boston
 
8319
Cleveland
 
8104
Other values (28)
189165 

Length

Max length25
Median length8
Mean length8.282652304
Min length2

Characters and Unicode

Total characters1953008
Distinct characters41
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMiami
2nd rowMiami
3rd rowMiami
4th rowMiami
5th rowMiami

Common Values

ValueCountFrequency (%)
Los Angeles13356
 
5.7%
San Antonio8433
 
3.6%
Miami8418
 
3.6%
Boston8319
 
3.5%
Cleveland8104
 
3.4%
Golden State8055
 
3.4%
Houston8027
 
3.4%
Dallas8019
 
3.4%
Indiana7974
 
3.4%
Detroit7931
 
3.4%
Other values (23)149159
63.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
new18901
 
6.5%
los13356
 
4.6%
angeles13356
 
4.6%
antonio8433
 
2.9%
san8433
 
2.9%
miami8418
 
2.9%
boston8319
 
2.9%
cleveland8104
 
2.8%
state8055
 
2.8%
golden8055
 
2.8%
Other values (28)187813
64.5%

Most occurring characters

ValueCountFrequency (%)
a195505
 
10.0%
o185560
 
9.5%
n183557
 
9.4%
e181870
 
9.3%
t143664
 
7.4%
l127678
 
6.5%
i109640
 
5.6%
s85500
 
4.4%
r76727
 
3.9%
h68126
 
3.5%
Other values (31)595181
30.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1601998
82.0%
Uppercase Letter294742
 
15.1%
Space Separator55448
 
2.8%
Other Punctuation820
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a195505
12.2%
o185560
11.6%
n183557
11.5%
e181870
11.4%
t143664
9.0%
l127678
8.0%
i109640
6.8%
s85500
 
5.3%
r76727
 
4.8%
h68126
 
4.3%
Other values (11)244171
15.2%
Uppercase Letter
ValueCountFrequency (%)
A32318
11.0%
M31421
10.7%
C29699
10.1%
S26106
8.9%
D23865
8.1%
P23261
 
7.9%
O22109
 
7.5%
N18901
 
6.4%
L16035
 
5.4%
B12256
 
4.2%
Other values (8)58771
19.9%
Space Separator
ValueCountFrequency (%)
55448
100.0%
Other Punctuation
ValueCountFrequency (%)
/820
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1896740
97.1%
Common56268
 
2.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a195505
 
10.3%
o185560
 
9.8%
n183557
 
9.7%
e181870
 
9.6%
t143664
 
7.6%
l127678
 
6.7%
i109640
 
5.8%
s85500
 
4.5%
r76727
 
4.0%
h68126
 
3.6%
Other values (29)538913
28.4%
Common
ValueCountFrequency (%)
55448
98.5%
/820
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII1953008
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a195505
 
10.0%
o185560
 
9.5%
n183557
 
9.4%
e181870
 
9.3%
t143664
 
7.4%
l127678
 
6.5%
i109640
 
5.6%
s85500
 
4.4%
r76727
 
3.9%
h68126
 
3.5%
Other values (31)595181
30.5%

PLAYER_NAME
Categorical

HIGH CARDINALITY

Distinct1433
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.8 MiB
LeBron James
 
1596
Dirk Nowitzki
 
1242
Chris Paul
 
1231
Carmelo Anthony
 
1219
Tony Parker
 
1188
Other values (1428)
229319 

Length

Max length24
Median length13
Mean length12.76854895
Min length4

Characters and Unicode

Total characters3010760
Distinct characters56
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)< 0.1%

Sample

1st rowLuol Deng
2nd rowUdonis Haslem
3rd rowChris Bosh
4th rowDwyane Wade
5th rowMario Chalmers

Common Values

ValueCountFrequency (%)
LeBron James1596
 
0.7%
Dirk Nowitzki1242
 
0.5%
Chris Paul1231
 
0.5%
Carmelo Anthony1219
 
0.5%
Tony Parker1188
 
0.5%
Dwight Howard1170
 
0.5%
Pau Gasol1135
 
0.5%
Tim Duncan1125
 
0.5%
Joe Johnson1105
 
0.5%
Dwyane Wade1105
 
0.5%
Other values (1423)223679
94.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
chris4067
 
0.9%
paul3958
 
0.8%
jason3723
 
0.8%
james3708
 
0.8%
kevin3630
 
0.8%
williams3620
 
0.8%
anthony3129
 
0.7%
mike3015
 
0.6%
johnson2915
 
0.6%
green2722
 
0.6%
Other values (1768)441160
92.7%

Most occurring characters

ValueCountFrequency (%)
e257515
 
8.6%
a247165
 
8.2%
239852
 
8.0%
n219313
 
7.3%
r217296
 
7.2%
o203480
 
6.8%
i169290
 
5.6%
l144788
 
4.8%
s129496
 
4.3%
t80506
 
2.7%
Other values (46)1102059
36.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2257841
75.0%
Uppercase Letter499110
 
16.6%
Space Separator239852
 
8.0%
Other Punctuation10672
 
0.4%
Dash Punctuation3285
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
J50965
 
10.2%
M42260
 
8.5%
D39682
 
8.0%
B39072
 
7.8%
A31931
 
6.4%
C29828
 
6.0%
R29562
 
5.9%
T26045
 
5.2%
S25295
 
5.1%
G24552
 
4.9%
Other values (16)159918
32.0%
Lowercase Letter
ValueCountFrequency (%)
e257515
11.4%
a247165
10.9%
n219313
9.7%
r217296
9.6%
o203480
 
9.0%
i169290
 
7.5%
l144788
 
6.4%
s129496
 
5.7%
t80506
 
3.6%
d78221
 
3.5%
Other values (16)510771
22.6%
Other Punctuation
ValueCountFrequency (%)
.7530
70.6%
'3142
29.4%
Space Separator
ValueCountFrequency (%)
239852
100.0%
Dash Punctuation
ValueCountFrequency (%)
-3285
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2756951
91.6%
Common253809
 
8.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e257515
 
9.3%
a247165
 
9.0%
n219313
 
8.0%
r217296
 
7.9%
o203480
 
7.4%
i169290
 
6.1%
l144788
 
5.3%
s129496
 
4.7%
t80506
 
2.9%
d78221
 
2.8%
Other values (42)1009881
36.6%
Common
ValueCountFrequency (%)
239852
94.5%
.7530
 
3.0%
-3285
 
1.3%
'3142
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII3010760
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e257515
 
8.6%
a247165
 
8.2%
239852
 
8.0%
n219313
 
7.3%
r217296
 
7.2%
o203480
 
6.8%
i169290
 
5.6%
l144788
 
4.8%
s129496
 
4.3%
t80506
 
2.7%
Other values (46)1102059
36.6%

START_POSITION
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.8 MiB
G
94330 
F
94322 
C
47143 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters235795
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowC
4th rowG
5th rowG

Common Values

ValueCountFrequency (%)
G94330
40.0%
F94322
40.0%
C47143
20.0%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
g94330
40.0%
f94322
40.0%
c47143
20.0%

Most occurring characters

ValueCountFrequency (%)
G94330
40.0%
F94322
40.0%
C47143
20.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter235795
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G94330
40.0%
F94322
40.0%
C47143
20.0%

Most occurring scripts

ValueCountFrequency (%)
Latin235795
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
G94330
40.0%
F94322
40.0%
C47143
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII235795
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
G94330
40.0%
F94322
40.0%
C47143
20.0%

MIN
Categorical

HIGH CARDINALITY

Distinct3200
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.8 MiB
24:00
 
371
33:55
 
239
35:48
 
238
34:25
 
234
35:36
 
233
Other values (3195)
234480 

Length

Max length5
Median length5
Mean length4.990847982
Min length4

Characters and Unicode

Total characters1176817
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique242 ?
Unique (%)0.1%

Sample

1st row24:50
2nd row10:32
3rd row25:20
4th row20:31
5th row21:16

Common Values

ValueCountFrequency (%)
24:00371
 
0.2%
33:55239
 
0.1%
35:48238
 
0.1%
34:25234
 
0.1%
35:36233
 
0.1%
36:09233
 
0.1%
33:01233
 
0.1%
35:34232
 
0.1%
35:16232
 
0.1%
34:28231
 
0.1%
Other values (3190)233319
98.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
24:00371
 
0.2%
33:55239
 
0.1%
35:48238
 
0.1%
34:25234
 
0.1%
35:36233
 
0.1%
36:09233
 
0.1%
33:01233
 
0.1%
35:34232
 
0.1%
35:16232
 
0.1%
34:28231
 
0.1%
Other values (3190)233319
98.9%

Most occurring characters

ValueCountFrequency (%)
:235795
20.0%
3197315
16.8%
2159898
13.6%
4114162
9.7%
1106472
9.0%
587503
 
7.4%
086498
 
7.4%
648047
 
4.1%
847142
 
4.0%
747021
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number941022
80.0%
Other Punctuation235795
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3197315
21.0%
2159898
17.0%
4114162
12.1%
1106472
11.3%
587503
9.3%
086498
9.2%
648047
 
5.1%
847142
 
5.0%
747021
 
5.0%
946964
 
5.0%
Other Punctuation
ValueCountFrequency (%)
:235795
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common1176817
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
:235795
20.0%
3197315
16.8%
2159898
13.6%
4114162
9.7%
1106472
9.0%
587503
 
7.4%
086498
 
7.4%
648047
 
4.1%
847142
 
4.0%
747021
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1176817
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
:235795
20.0%
3197315
16.8%
2159898
13.6%
4114162
9.7%
1106472
9.0%
587503
 
7.4%
086498
 
7.4%
648047
 
4.1%
847142
 
4.0%
747021
 
4.0%

FGM
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct26
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.155401938
Minimum0
Maximum28
Zeros8341
Zeros (%)3.5%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q37
95-th percentile11
Maximum28
Range28
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.133111021
Coefficient of variation (CV)0.6077336081
Kurtosis0.3817920148
Mean5.155401938
Median Absolute Deviation (MAD)2
Skewness0.6703263532
Sum1215618
Variance9.81638467
MonotonicityNot monotonic
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
430117
12.8%
328889
12.3%
528833
12.2%
625264
10.7%
224678
10.5%
720928
8.9%
117614
7.5%
816577
7.0%
912041
 
5.1%
108490
 
3.6%
Other values (16)22364
9.5%
ValueCountFrequency (%)
08341
 
3.5%
117614
7.5%
224678
10.5%
328889
12.3%
430117
12.8%
528833
12.2%
625264
10.7%
720928
8.9%
816577
7.0%
912041
 
5.1%
ValueCountFrequency (%)
281
 
< 0.1%
241
 
< 0.1%
233
 
< 0.1%
225
 
< 0.1%
2113
 
< 0.1%
2029
 
< 0.1%
1964
 
< 0.1%
1885
 
< 0.1%
17215
0.1%
16383
0.2%

FGA
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct47
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.16402383
Minimum0
Maximum50
Zeros970
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile3
Q17
median11
Q315
95-th percentile21
Maximum50
Range50
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.61073546
Coefficient of variation (CV)0.5025728665
Kurtosis0.2799226118
Mean11.16402383
Median Absolute Deviation (MAD)4
Skewness0.5759042881
Sum2632421
Variance31.4803524
MonotonicityNot monotonic
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
916866
 
7.2%
1016471
 
7.0%
1116248
 
6.9%
816233
 
6.9%
715403
 
6.5%
1215113
 
6.4%
1314007
 
5.9%
613927
 
5.9%
1412564
 
5.3%
512045
 
5.1%
Other values (37)86918
36.9%
ValueCountFrequency (%)
0970
 
0.4%
12617
 
1.1%
24790
 
2.0%
37180
3.0%
49801
4.2%
512045
5.1%
613927
5.9%
715403
6.5%
816233
6.9%
916866
7.2%
ValueCountFrequency (%)
501
 
< 0.1%
461
 
< 0.1%
451
 
< 0.1%
443
 
< 0.1%
433
 
< 0.1%
413
 
< 0.1%
403
 
< 0.1%
399
< 0.1%
388
 
< 0.1%
3722
< 0.1%

FG_PCT
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct311
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4558338557
Minimum0
Maximum1
Zeros8341
Zeros (%)3.5%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0.143
Q10.333
median0.455
Q30.571
95-th percentile0.75
Maximum1
Range1
Interquartile range (IQR)0.238

Descriptive statistics

Standard deviation0.1867048761
Coefficient of variation (CV)0.4095897523
Kurtosis0.6905640973
Mean0.4558338557
Median Absolute Deviation (MAD)0.116
Skewness0.04496138142
Sum107483.344
Variance0.03485871074
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.527853
 
11.8%
0.410539
 
4.5%
0.3339853
 
4.2%
0.6679428
 
4.0%
08341
 
3.5%
0.67672
 
3.3%
0.4297656
 
3.2%
0.257387
 
3.1%
0.3755796
 
2.5%
0.5715756
 
2.4%
Other values (301)135514
57.5%
ValueCountFrequency (%)
08341
3.5%
0.0563
 
< 0.1%
0.0591
 
< 0.1%
0.0591
 
< 0.1%
0.0637
 
< 0.1%
0.06711
 
< 0.1%
0.07119
 
< 0.1%
0.07752
 
< 0.1%
0.08384
 
< 0.1%
0.091175
 
0.1%
ValueCountFrequency (%)
13941
1.7%
0.9381
 
< 0.1%
0.9331
 
< 0.1%
0.9332
 
< 0.1%
0.9299
 
< 0.1%
0.9234
 
< 0.1%
0.92311
 
< 0.1%
0.91721
 
< 0.1%
0.90968
 
< 0.1%
0.9121
 
0.1%

FG3M
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct15
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.010284357
Minimum0
Maximum14
Zeros121679
Zeros (%)51.6%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile4
Maximum14
Range14
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.383695112
Coefficient of variation (CV)1.369609558
Kurtosis3.173297286
Mean1.010284357
Median Absolute Deviation (MAD)0
Skewness1.661790811
Sum238220
Variance1.914612163
MonotonicityNot monotonic
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0121679
51.6%
149218
20.9%
231774
 
13.5%
317801
 
7.5%
48807
 
3.7%
53870
 
1.6%
61648
 
0.7%
7627
 
0.3%
8233
 
0.1%
979
 
< 0.1%
Other values (5)59
 
< 0.1%
ValueCountFrequency (%)
0121679
51.6%
149218
20.9%
231774
 
13.5%
317801
 
7.5%
48807
 
3.7%
53870
 
1.6%
61648
 
0.7%
7627
 
0.3%
8233
 
0.1%
979
 
< 0.1%
ValueCountFrequency (%)
141
 
< 0.1%
132
 
< 0.1%
121
 
< 0.1%
1119
 
< 0.1%
1036
 
< 0.1%
979
 
< 0.1%
8233
 
0.1%
7627
 
0.3%
61648
0.7%
53870
1.6%

FG3A
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct25
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.789054051
Minimum0
Maximum24
Zeros74056
Zeros (%)31.4%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q35
95-th percentile8
Maximum24
Range24
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.857612294
Coefficient of variation (CV)1.024581181
Kurtosis1.010592461
Mean2.789054051
Median Absolute Deviation (MAD)2
Skewness1.068899273
Sum657645
Variance8.165948025
MonotonicityNot monotonic
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
074056
31.4%
226479
 
11.2%
126380
 
11.2%
325989
 
11.0%
423075
 
9.8%
518811
 
8.0%
614210
 
6.0%
79851
 
4.2%
86694
 
2.8%
94221
 
1.8%
Other values (15)6029
 
2.6%
ValueCountFrequency (%)
074056
31.4%
126380
 
11.2%
226479
 
11.2%
325989
 
11.0%
423075
 
9.8%
518811
 
8.0%
614210
 
6.0%
79851
 
4.2%
86694
 
2.8%
94221
 
1.8%
ValueCountFrequency (%)
241
 
< 0.1%
232
 
< 0.1%
225
 
< 0.1%
215
 
< 0.1%
208
 
< 0.1%
1915
 
< 0.1%
1823
 
< 0.1%
1742
 
< 0.1%
1685
< 0.1%
15158
0.1%

FG3_PCT
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct99
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2289670476
Minimum0
Maximum1
Zeros121679
Zeros (%)51.6%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.429
95-th percentile0.8
Maximum1
Range1
Interquartile range (IQR)0.429

Descriptive statistics

Standard deviation0.2841844932
Coefficient of variation (CV)1.241158919
Kurtosis0.2512145088
Mean0.2289670476
Median Absolute Deviation (MAD)0
Skewness1.053621362
Sum53989.285
Variance0.0807608262
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0121679
51.6%
0.524470
 
10.4%
0.33312304
 
5.2%
0.2510563
 
4.5%
110325
 
4.4%
0.6677564
 
3.2%
0.47155
 
3.0%
0.25972
 
2.5%
0.3335353
 
2.3%
0.63953
 
1.7%
Other values (89)26457
 
11.2%
ValueCountFrequency (%)
0121679
51.6%
0.0592
 
< 0.1%
0.0771
 
< 0.1%
0.08311
 
< 0.1%
0.09132
 
< 0.1%
0.177
 
< 0.1%
0.111203
 
0.1%
0.125609
 
0.3%
0.1333
 
< 0.1%
0.1431362
 
0.6%
ValueCountFrequency (%)
110325
4.4%
0.9092
 
< 0.1%
0.88915
 
< 0.1%
0.87537
 
< 0.1%
0.857143
 
0.1%
0.8461
 
< 0.1%
0.833390
 
0.2%
0.8185
 
< 0.1%
0.81039
 
0.4%
0.7861
 
< 0.1%

FTM
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct27
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.549706313
Minimum0
Maximum26
Zeros70624
Zeros (%)30.0%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q34
95-th percentile8
Maximum26
Range26
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.773658603
Coefficient of variation (CV)1.087834544
Kurtosis3.146026472
Mean2.549706313
Median Absolute Deviation (MAD)2
Skewness1.5448468
Sum601208
Variance7.693182045
MonotonicityNot monotonic
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
070624
30.0%
241314
17.5%
131766
13.5%
324341
 
10.3%
421628
 
9.2%
513847
 
5.9%
610524
 
4.5%
77015
 
3.0%
84993
 
2.1%
93227
 
1.4%
Other values (17)6516
 
2.8%
ValueCountFrequency (%)
070624
30.0%
131766
13.5%
241314
17.5%
324341
 
10.3%
421628
 
9.2%
513847
 
5.9%
610524
 
4.5%
77015
 
3.0%
84993
 
2.1%
93227
 
1.4%
ValueCountFrequency (%)
261
 
< 0.1%
251
 
< 0.1%
247
 
< 0.1%
235
 
< 0.1%
228
 
< 0.1%
2118
 
< 0.1%
2015
 
< 0.1%
1931
 
< 0.1%
1865
< 0.1%
17109
< 0.1%

FTA
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct33
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.319595411
Minimum0
Maximum39
Zeros62467
Zeros (%)26.5%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q35
95-th percentile10
Maximum39
Range39
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.364382444
Coefficient of variation (CV)1.013491714
Kurtosis2.799386122
Mean3.319595411
Median Absolute Deviation (MAD)2
Skewness1.426601067
Sum782744
Variance11.31906923
MonotonicityNot monotonic
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
062467
26.5%
248930
20.8%
429851
12.7%
616725
 
7.1%
315940
 
6.8%
113517
 
5.7%
512724
 
5.4%
88924
 
3.8%
78159
 
3.5%
94724
 
2.0%
Other values (23)13834
 
5.9%
ValueCountFrequency (%)
062467
26.5%
113517
 
5.7%
248930
20.8%
315940
 
6.8%
429851
12.7%
512724
 
5.4%
616725
 
7.1%
78159
 
3.5%
88924
 
3.8%
94724
 
2.0%
ValueCountFrequency (%)
392
 
< 0.1%
361
 
< 0.1%
342
 
< 0.1%
291
 
< 0.1%
283
 
< 0.1%
276
 
< 0.1%
267
 
< 0.1%
2512
 
< 0.1%
2431
< 0.1%
2325
< 0.1%

FT_PCT
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct162
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5575686889
Minimum0
Maximum1
Zeros70624
Zeros (%)30.0%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.667
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.406276929
Coefficient of variation (CV)0.7286580777
Kurtosis-1.486007842
Mean0.5575686889
Median Absolute Deviation (MAD)0.333
Skewness-0.356002991
Sum131471.909
Variance0.1650609431
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
070624
30.0%
169324
29.4%
0.526165
 
11.1%
0.7514212
 
6.0%
0.66711250
 
4.8%
0.8336246
 
2.6%
0.86023
 
2.6%
0.63213
 
1.4%
0.8572845
 
1.2%
0.8752723
 
1.2%
Other values (152)23170
 
9.8%
ValueCountFrequency (%)
070624
30.0%
0.0912
 
< 0.1%
0.15
 
< 0.1%
0.1116
 
< 0.1%
0.12521
 
< 0.1%
0.14337
 
< 0.1%
0.1542
 
< 0.1%
0.167165
 
0.1%
0.1825
 
< 0.1%
0.1881
 
< 0.1%
ValueCountFrequency (%)
169324
29.4%
0.9631
 
< 0.1%
0.961
 
< 0.1%
0.9582
 
< 0.1%
0.9571
 
< 0.1%
0.9554
 
< 0.1%
0.9523
 
< 0.1%
0.955
 
< 0.1%
0.9476
 
< 0.1%
0.9474
 
< 0.1%

OREB
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.382849509
Minimum0
Maximum18
Zeros87416
Zeros (%)37.1%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum18
Range18
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.606538972
Coefficient of variation (CV)1.161759802
Kurtosis3.444577172
Mean1.382849509
Median Absolute Deviation (MAD)1
Skewness1.628471399
Sum326069
Variance2.580967469
MonotonicityNot monotonic
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
087416
37.1%
164950
27.5%
237819
16.0%
321300
 
9.0%
411817
 
5.0%
56218
 
2.6%
63209
 
1.4%
71641
 
0.7%
8788
 
0.3%
9378
 
0.2%
Other values (8)259
 
0.1%
ValueCountFrequency (%)
087416
37.1%
164950
27.5%
237819
16.0%
321300
 
9.0%
411817
 
5.0%
56218
 
2.6%
63209
 
1.4%
71641
 
0.7%
8788
 
0.3%
9378
 
0.2%
ValueCountFrequency (%)
181
 
< 0.1%
161
 
< 0.1%
151
 
< 0.1%
141
 
< 0.1%
1311
 
< 0.1%
1230
 
< 0.1%
1170
 
< 0.1%
10144
 
0.1%
9378
0.2%
8788
0.3%

DREB
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct25
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.215734006
Minimum0
Maximum25
Zeros12295
Zeros (%)5.2%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median4
Q36
95-th percentile10
Maximum25
Range25
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.918633606
Coefficient of variation (CV)0.6923192027
Kurtosis1.297800744
Mean4.215734006
Median Absolute Deviation (MAD)2
Skewness1.019662492
Sum994049
Variance8.518422126
MonotonicityNot monotonic
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
336385
15.4%
235736
15.2%
432565
13.8%
127148
11.5%
526232
11.1%
619632
8.3%
714473
 
6.1%
012295
 
5.2%
810423
 
4.4%
97393
 
3.1%
Other values (15)13513
 
5.7%
ValueCountFrequency (%)
012295
 
5.2%
127148
11.5%
235736
15.2%
336385
15.4%
432565
13.8%
526232
11.1%
619632
8.3%
714473
 
6.1%
810423
 
4.4%
97393
 
3.1%
ValueCountFrequency (%)
251
 
< 0.1%
235
 
< 0.1%
224
 
< 0.1%
216
 
< 0.1%
2024
 
< 0.1%
1951
 
< 0.1%
1897
 
< 0.1%
17142
 
0.1%
16260
0.1%
15443
0.2%

REB
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct32
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.598583515
Minimum0
Maximum31
Zeros7168
Zeros (%)3.0%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q38
95-th percentile13
Maximum31
Range31
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.781479646
Coefficient of variation (CV)0.6754350696
Kurtosis1.318035379
Mean5.598583515
Median Absolute Deviation (MAD)2
Skewness1.053289185
Sum1320118
Variance14.29958831
MonotonicityNot monotonic
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
329173
12.4%
429092
12.3%
526343
11.2%
225459
10.8%
621997
9.3%
718179
7.7%
117459
7.4%
814241
6.0%
911026
 
4.7%
109607
 
4.1%
Other values (22)33219
14.1%
ValueCountFrequency (%)
07168
 
3.0%
117459
7.4%
225459
10.8%
329173
12.4%
429092
12.3%
526343
11.2%
621997
9.3%
718179
7.7%
814241
6.0%
911026
 
4.7%
ValueCountFrequency (%)
311
 
< 0.1%
303
 
< 0.1%
293
 
< 0.1%
282
 
< 0.1%
279
 
< 0.1%
2620
 
< 0.1%
2524
 
< 0.1%
2449
 
< 0.1%
2377
< 0.1%
22136
0.1%

AST
Real number (ℝ≥0)

ZEROS

Distinct26
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.057528786
Minimum0
Maximum25
Zeros38786
Zeros (%)16.4%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile9
Maximum25
Range25
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.882145369
Coefficient of variation (CV)0.9426388339
Kurtosis2.489878199
Mean3.057528786
Median Absolute Deviation (MAD)2
Skewness1.435215185
Sum720950
Variance8.306761928
MonotonicityNot monotonic
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
146525
19.7%
240592
17.2%
038786
16.4%
330708
13.0%
422383
9.5%
516201
 
6.9%
612022
 
5.1%
78741
 
3.7%
86206
 
2.6%
94381
 
1.9%
Other values (16)9250
 
3.9%
ValueCountFrequency (%)
038786
16.4%
146525
19.7%
240592
17.2%
330708
13.0%
422383
9.5%
516201
 
6.9%
612022
 
5.1%
78741
 
3.7%
86206
 
2.6%
94381
 
1.9%
ValueCountFrequency (%)
251
 
< 0.1%
244
 
< 0.1%
232
 
< 0.1%
223
 
< 0.1%
2111
 
< 0.1%
2030
 
< 0.1%
1941
 
< 0.1%
1856
 
< 0.1%
17137
0.1%
16217
0.1%

STL
Real number (ℝ≥0)

ZEROS

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9782480545
Minimum0
Maximum10
Zeros97779
Zeros (%)41.5%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile3
Maximum10
Range10
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.095574094
Coefficient of variation (CV)1.119934856
Kurtosis2.15287415
Mean0.9782480545
Median Absolute Deviation (MAD)1
Skewness1.3174096
Sum230666
Variance1.200282595
MonotonicityNot monotonic
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
097779
41.5%
177586
32.9%
238376
 
16.3%
314804
 
6.3%
45110
 
2.2%
51567
 
0.7%
6425
 
0.2%
7105
 
< 0.1%
834
 
< 0.1%
96
 
< 0.1%
ValueCountFrequency (%)
097779
41.5%
177586
32.9%
238376
 
16.3%
314804
 
6.3%
45110
 
2.2%
51567
 
0.7%
6425
 
0.2%
7105
 
< 0.1%
834
 
< 0.1%
96
 
< 0.1%
ValueCountFrequency (%)
103
 
< 0.1%
96
 
< 0.1%
834
 
< 0.1%
7105
 
< 0.1%
6425
 
0.2%
51567
 
0.7%
45110
 
2.2%
314804
 
6.3%
238376
16.3%
177586
32.9%

BLK
Real number (ℝ≥0)

ZEROS

Distinct13
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6370957824
Minimum0
Maximum12
Zeros143602
Zeros (%)60.9%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum12
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.021306481
Coefficient of variation (CV)1.603065833
Kurtosis6.972941411
Mean0.6370957824
Median Absolute Deviation (MAD)0
Skewness2.241525791
Sum150224
Variance1.043066928
MonotonicityNot monotonic
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
0143602
60.9%
157071
 
24.2%
221291
 
9.0%
38312
 
3.5%
43277
 
1.4%
51433
 
0.6%
6498
 
0.2%
7195
 
0.1%
863
 
< 0.1%
932
 
< 0.1%
Other values (3)21
 
< 0.1%
ValueCountFrequency (%)
0143602
60.9%
157071
 
24.2%
221291
 
9.0%
38312
 
3.5%
43277
 
1.4%
51433
 
0.6%
6498
 
0.2%
7195
 
0.1%
863
 
< 0.1%
932
 
< 0.1%
ValueCountFrequency (%)
121
 
< 0.1%
115
 
< 0.1%
1015
 
< 0.1%
932
 
< 0.1%
863
 
< 0.1%
7195
 
0.1%
6498
 
0.2%
51433
 
0.6%
43277
 
1.4%
38312
3.5%

TO
Real number (ℝ≥0)

ZEROS

Distinct13
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.838448652
Minimum0
Maximum12
Zeros48402
Zeros (%)20.5%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile5
Maximum12
Range12
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.558167613
Coefficient of variation (CV)0.8475448095
Kurtosis1.223951956
Mean1.838448652
Median Absolute Deviation (MAD)1
Skewness1.014007183
Sum433497
Variance2.42788631
MonotonicityNot monotonic
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
166176
28.1%
253961
22.9%
048402
20.5%
334022
14.4%
418468
 
7.8%
58712
 
3.7%
63681
 
1.6%
71560
 
0.7%
8534
 
0.2%
9200
 
0.1%
Other values (3)79
 
< 0.1%
ValueCountFrequency (%)
048402
20.5%
166176
28.1%
253961
22.9%
334022
14.4%
418468
 
7.8%
58712
 
3.7%
63681
 
1.6%
71560
 
0.7%
8534
 
0.2%
9200
 
0.1%
ValueCountFrequency (%)
126
 
< 0.1%
1121
 
< 0.1%
1052
 
< 0.1%
9200
 
0.1%
8534
 
0.2%
71560
 
0.7%
63681
 
1.6%
58712
 
3.7%
418468
7.8%
334022
14.4%

PF
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.452740728
Minimum0
Maximum6
Zeros20991
Zeros (%)8.9%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.467876329
Coefficient of variation (CV)0.5984637154
Kurtosis-0.5888790249
Mean2.452740728
Median Absolute Deviation (MAD)1
Skewness0.2389961431
Sum578344
Variance2.154660917
MonotonicityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
258318
24.7%
352558
22.3%
146115
19.6%
435728
15.2%
020991
 
8.9%
517503
 
7.4%
64582
 
1.9%
ValueCountFrequency (%)
020991
 
8.9%
146115
19.6%
258318
24.7%
352558
22.3%
435728
15.2%
517503
 
7.4%
64582
 
1.9%
ValueCountFrequency (%)
64582
 
1.9%
517503
 
7.4%
435728
15.2%
352558
22.3%
258318
24.7%
146115
19.6%
020991
 
8.9%

PTS
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct66
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.87079455
Minimum0
Maximum81
Zeros5577
Zeros (%)2.4%
Negative0
Negative (%)0.0%
Memory size1.8 MiB

Quantile statistics

Minimum0
5-th percentile2
Q18
median13
Q319
95-th percentile29
Maximum81
Range81
Interquartile range (IQR)11

Descriptive statistics

Standard deviation8.372313464
Coefficient of variation (CV)0.6035929259
Kurtosis0.5860038519
Mean13.87079455
Median Absolute Deviation (MAD)6
Skewness0.7282081689
Sum3270664
Variance70.09563274
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
812526
 
5.3%
1012496
 
5.3%
1211824
 
5.0%
611667
 
4.9%
1410928
 
4.6%
1110729
 
4.6%
1310387
 
4.4%
410002
 
4.2%
99848
 
4.2%
159757
 
4.1%
Other values (56)125631
53.3%
ValueCountFrequency (%)
05577
2.4%
1977
 
0.4%
28592
3.6%
34680
 
2.0%
410002
4.2%
57229
3.1%
611667
4.9%
78620
3.7%
812526
5.3%
99848
4.2%
ValueCountFrequency (%)
811
 
< 0.1%
701
 
< 0.1%
651
 
< 0.1%
624
 
< 0.1%
616
< 0.1%
6011
< 0.1%
594
 
< 0.1%
584
 
< 0.1%
579
< 0.1%
565
< 0.1%

PLUS_MINUS
Real number (ℝ)

ZEROS

Distinct104
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3684047584
Minimum-57
Maximum54
Zeros7488
Zeros (%)3.2%
Negative113514
Negative (%)48.1%
Memory size1.8 MiB

Quantile statistics

Minimum-57
5-th percentile-19
Q1-8
median0
Q39
95-th percentile21
Maximum54
Range111
Interquartile range (IQR)17

Descriptive statistics

Standard deviation12.29651244
Coefficient of variation (CV)33.37772425
Kurtosis-0.09377209286
Mean0.3684047584
Median Absolute Deviation (MAD)8
Skewness0.08613220854
Sum86868
Variance151.2042182
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-27518
 
3.2%
07488
 
3.2%
-17485
 
3.2%
-37388
 
3.1%
-47384
 
3.1%
17365
 
3.1%
27317
 
3.1%
37221
 
3.1%
47095
 
3.0%
-56988
 
3.0%
Other values (94)162546
68.9%
ValueCountFrequency (%)
-571
 
< 0.1%
-511
 
< 0.1%
-501
 
< 0.1%
-483
 
< 0.1%
-475
 
< 0.1%
-466
 
< 0.1%
-4510
< 0.1%
-449
< 0.1%
-4313
< 0.1%
-4218
< 0.1%
ValueCountFrequency (%)
541
 
< 0.1%
511
 
< 0.1%
502
 
< 0.1%
494
 
< 0.1%
483
 
< 0.1%
476
 
< 0.1%
4611
< 0.1%
4512
< 0.1%
4415
< 0.1%
4318
< 0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexGAME_IDTEAM_ABBREVIATIONTEAM_CITYPLAYER_NAMESTART_POSITIONMINFGMFGAFG_PCTFG3MFG3AFG3_PCTFTMFTAFT_PCTOREBDREBREBASTSTLBLKTOPFPTSPLUS_MINUS
054302211400001MIAMiamiLuol DengF24:502.07.00.2860.01.00.0000.00.00.0002.01.03.01.02.00.00.00.04.0-16.0
154302311400001MIAMiamiUdonis HaslemF10:321.02.00.5000.01.00.0000.00.00.0000.02.02.01.01.00.00.02.02.0-8.0
254302411400001MIAMiamiChris BoshC25:203.013.00.2311.03.00.3332.03.00.6671.05.06.02.01.00.04.02.09.0-10.0
354302511400001MIAMiamiDwyane WadeG20:312.07.00.2860.01.00.0002.04.00.5000.02.02.03.00.00.01.01.06.0-11.0
454302611400001MIAMiamiMario ChalmersG21:160.02.00.0000.01.00.0002.02.01.0000.03.03.02.01.00.02.01.02.0-6.0
554300311400001NOPNew OrleansDarius MillerF32:234.012.00.3330.05.00.0000.00.00.0000.03.03.02.01.00.01.03.08.09.0
654300411400001NOPNew OrleansAnthony DavisF11:202.04.00.5000.00.00.0001.02.00.5001.03.04.00.00.03.01.00.05.00.0
754300511400001NOPNew OrleansOmer AsikC6:041.02.00.5000.00.00.0000.00.00.0000.03.03.01.00.01.01.02.02.03.0
854300611400001NOPNew OrleansEric GordonG10:070.04.00.0000.02.00.0004.04.01.0000.01.01.02.00.00.01.00.04.04.0
954300711400001NOPNew OrleansJrue HolidayG10:070.02.00.0000.01.00.0000.00.00.0000.02.02.00.00.00.01.02.00.04.0

Last rows

df_indexGAME_IDTEAM_ABBREVIATIONTEAM_CITYPLAYER_NAMESTART_POSITIONMINFGMFGAFG_PCTFG3MFG3AFG3_PCTFTMFTAFT_PCTOREBDREBREBASTSTLBLKTOPFPTSPLUS_MINUS
23578549152000211GSWGolden StateAndrew WigginsF44:1810.022.00.4551.04.00.251.01.01.0001.09.010.02.00.02.02.04.022.0-8.0
23578649252000211GSWGolden StateDraymond GreenF45:055.011.00.4551.01.01.000.00.00.0001.015.016.010.01.01.06.03.011.0-2.0
23578749352000211GSWGolden StateKevon LooneyC24:271.01.01.0000.00.00.001.02.00.5003.01.04.03.01.01.01.03.03.0-15.0
23578849452000211GSWGolden StateKent BazemoreG25:304.012.00.3331.05.00.201.04.00.2501.03.04.02.02.01.00.02.010.0-10.0
23578949552000211GSWGolden StateStephen CurryG47:2313.028.00.4646.015.00.407.07.01.0001.03.04.05.03.00.07.05.039.04.0
23579047652000211MEMMemphisKyle AndersonF39:012.07.00.2860.01.00.005.05.01.0002.08.010.06.01.02.00.01.09.011.0
23579147752000211MEMMemphisJaren Jackson Jr.F14:561.06.00.1671.04.00.257.08.00.8752.00.02.00.00.02.01.04.010.01.0
23579247852000211MEMMemphisJonas ValanciunasC25:333.06.00.5001.01.01.002.04.00.5006.06.012.03.00.00.03.06.09.07.0
23579347952000211MEMMemphisDillon BrooksG45:047.022.00.3180.04.00.000.00.00.0001.01.02.03.02.00.01.05.014.0-1.0
23579448052000211MEMMemphisJa MorantG45:4514.029.00.4835.010.00.502.02.01.0002.04.06.06.04.00.05.02.035.0-1.0